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Abstract — This paper studies the problem of content 
distribution in wireless peer-to-peer networks where 
all nodes are selfish and non-cooperative. We propose 
a model that considers both the broadcast nature of 
wireless channels and the incentives of nodes, where 
each node aims to increase its own download rate and 
reduce its upload rate through the course of content 
distribution. We then propose a protocol for these 
selfish nodes to exchange contents. Our protocol is dis- 
tributed and does not require the exchange of money, 
reputation, etc., and hence can be easily implemented 
without additional infrastructure. Moreover, we show 
that our protocol can be easily modified to employ 
network coding. 

The performance of our protocol is studied. We 
derive a closed-form expression of Nash Equilibriums 
when there are only two files in the system. The prices 
of anarchy, both from each node's perspective and 
the whole system's perspective, are also characterized. 
Moreover, we propose a distributed mechanism where 
each node adjusts its strategies only based on local 
information and show that the mechanism converges 
to a Nash Equilibrium. We also introduce an approach 
for calculating Nash Equilibriums for systems that 
incorporate network coding when there are more than 
two files. 

I. Introduction 

It is known that using wireless peer-to-peer (P2P) 
networks to distribute information locally can im- 
prove the system performance in many different 
aspects. For example, in cellular networks, nearby 
mobile devices can exchange data with each other 
instead of contacting the faraway base station for 
the data. Since exchanging data with nearby devices 
requires much less power and results in less interfer- 
ence to other devices, such an approach may reduce 
power consumption and increase spatial reuse. 

Many existing studies (e.g., |Q]|-|13]|) have demon- 
strated the benefits of wireless P2P networks. How- 
ever, these studies have assumed that all nodes 
are cooperative. In practice, nodes may be selfish. 
Therefore, a major challenge for P2P networks is 
to provide incentives to nodes in the network so 



that they are willing to contribute to the network 
by transmitting data that they possess. While there 
are a lot of studies, such as |]4]|-|l6]], on this topic 
for P2P networks over Internet, these works cannot 
be applied to wireless P2P networks. Due to the 
broadcast nature of wireless transmissions, when 
a node transmits a packet, all nodes within the 
proximity are able to receive the packet. Therefore, 
in wireless P2P networks, data exchange involves all 
nodes within the system, rather than only two nodes 
as in P2P networks over Internet. 

In this paper, we study wireless P2P networks 
composed of selfish nodes. We first provide a model 
that considers the broadcast nature of wireless trans- 
missions and the incentives of selfish nodes. Each 
node in the system aims to increase its download 
rate and decrease its upload rate, so as to reduce 
its own power consumption. We then propose a 
protocol for content distribution in these wireless 
P2P networks. Our protocol does not require the 
exchange of money, reputation, etc., and hence can 
be implemented without the need of additional in- 
frastructure. This non-monetary feature further dis- 
tinguishes our work from other studies that rely on 
additional infrastructure to set prices or payoffs [6]- 
J8), or to punish uncooperative nodes [9]. Moreover, 
our protocol can be easily modified to employ net- 
work coding. 

The performance of our protocol is studied. When 
there are only two files in the system, we show that 
there are closed-form expressions for each node's 
strategies under a series of Nash Equilibriums. We 
also derive the prices of anarchy under these Nash 
Equilibriums, both from a node's selfish perspective 
and the whole system's perspective. 

To compute its strategy under a Nash Equilibrium, 
a node needs information of all other nodes, which 
is not always available to the node. To address 
this challenge, we propose a distributed mechanism 
where each node updates its strategy only based on 
its private information and the history of the system. 
We show that this distributed mechanism converges 
to a Nash Equilibrium. Moreover, this mechanism 
is also consistent with each node's incentive, as 



the expected cost of each node reduces with each 
update. 

We then consider systems that have more than 
two files and employ network coding. We propose 
a systematic approach to compute the Nash Equilib- 
riums. The performance of such systems are further 
investigated through numerical studies. 

The rest of the paper is organized as follows. 
Section [II] proposes our system model and protocol 
for content distribution. Section [III] studies the Nash 
Equilibriums for systems with only two files. Section 
IPvl studies the prices of anarchy under these Nash 
Equilibriums. Section |V| discusses implementation 
issues and provides a distributed mechanism for 
nodes to update their strategies. Section |Vl] studies 
the Nash Equilibriums when there are more than two 
files in systems that employ network coding. Section 
IVIII p rovides some numerical results. Finally, Section 
IVIIlf concludes the paper. 

II. System Model and Protocol Overview 

Consider a system with a number of wireless 
nodes that share a set of files or video/audio 
streams. We assume that each file, or stream, is 
composed of infinitely many packets. This assump- 
tion is natural when nodes share a set of streams. 
In scenarios where nodes share a set of files, this 
assumption is a good approximation when the size 
of each file is at least several MBytes, or, consists 
of at least thousands of packets. We denote the set 
of nodes by N, and the set of files, or streams, by 
X = {A, B, C, . . . }. We assume that each node needs 
to download one file from other peers, and holds all 
packets of all the other files. One way to enforce this 
assumption is by requiring each node to download 
all but one files through, say, its own 3G connection 
before admitting the node into the system. We use 
X n to denote the file that client needs. For example, 
if X n = B, then node n needs to download B from 
its peers, and have all data of files A, C, D, . . . . In 
this paper, we use upper-case letters to represent 
files and lower-case letters to represent nodes. When 
the file that a node needs is not important in the 
context, we use n,m, etc., to denote a node. On 
the other hand, when we want to stress that the 
indicated node needs a certain file, say, file A, we 
use a-i to denote the node. 

Most protocols for peer-to-peer content distribu- 
tion involves the data exchange between two peers. 
However, those protocols may not be directly ap- 
plicable to wireless broadcast networks. In wireless 
networks, all transmissions are carried by the shared 
wireless medium. We assume that, when a node 
transmits a packet from a file, all nodes receive 
this packet. The broadcasting nature of wireless 
transmissions may result in a "free rider" problem 
as depicted in the following example. 

Example 1: Consider a system with four nodes 
and two files where X\ = X2 = A and X% = X4 = 
B. Suppose that node 1 and node 3 exchange data, 



that is, node 1 transmits packets of B, and node 
3 transmits packets of A in return. As all nodes 
can receive all transmissions, node 2 and node 4 
can obtain packets of A and B without making any 
transmissions. Therefore, we say that node 2 and 
node 4 are free riders. 

In addition to being unfair, the presence of free 
riders may prevent nodes from transmitting data 
and contributing to the networks when all nodes are 
selfish. In the above example, each node may refrain 
from transmitting data, in the hope that other nodes 
participate in exchanging data, making itself a free 
rider. 

In this paper, we propose a non-monetary protocol 
for P2P content distribution and study its perfor- 
mance when all nodes are selfish. Before introducing 
the protocol, we first formally describe the goal of 
each node. 

We define the time needed for transmitting a data 
packet to be one unit time. We assume that the goal 
of a node is to increase its download rate and to 
reduce the number of transmissions it makes. To 
be more specific, we assume that whenever node n 
transmits a data packet, it needs to pay a transmis- 
sion cost of g n . The transmission cost can be chosen 
to, for example, reflect the amount of power needed 
for making a transmission. On the other hand, node 
n pays a waiting cost at rate w n per unit time through 
the course of the protocol. Suppose that, a node 
n receives two needed packets at time t\ and t%, 
respectively. During time [ii^a], node n transmits 
a total number of a n packets. Thus, to download 
the packet at time ti, node n waits a total amount 
of t2 — h time, and makes a n transmissions. The 
total cost for the packet that node n downloads at 
ti can then be expressed as a n g n + (*2 — t\)w n . The 
average total cost of node n is then defined as the 
long-term average total cost per downloaded packet. 
The following example explains the computation of 
average total cost. 

Example 2: Suppose a node n downloads a total 
number of d n (t) packets that it needs, and transmits 
u n (t) packets, during time [0,t]. The average total 

cost of node n is then limsup^^ — • 

We assume that the goal of node n is to minimize 
its average total cost. Node n wishes to reduce 
lim sup^^ in order to achieve small transmis- 
sion cost. On the other hand, the download rate of 
node n can be expressed as liminf t ^oo ^p-, which 
is the inverse of limsup t ^ oc Therefore, to have 
high download rate, node n also wishes to have 
small limsupt..^ 

We now describe our protocol for P2P content 
distribution. The process is composed of rounds, 
where two nodes transmit two packets that each 
other needs in each round. At the beginning of a 
round, each node n secretly and randomly picks a 
backoff timer, t n . Node n then waits and listens to 
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Fig. 1: An example of a round. 



the channel for t n time. If no transmissions take 
place in t n time, node n transmits a control packet 
that contains its value of X n . This control packet is 
interpreted as a promise from node n saying that 
if any node transmits a packet of X n , node n will 
respond with a packet that n has. As the length 
of this control packet is much smaller than a data 
packet, we assume that time needed for transmitting 
the control packet is negligible and node n does not 
pay transmission cost for the control packet. After 
node n transmits the control packet, every node m 
that has the file X n secretly and randomly picks a 
backoff timer i m . Node m waits and listens to the 
channel for i m time. If no other nodes transmit in 
i m time, node m transmits a data packet of X n , and 
piggybacks its value of X m . Upon receiving the data 
packet from node m, node n responds with a packet 
of X m , as promised in its control packet. The round 
ends after node n responds the packet and a new 
round begins. 

As explained in the previous paragraph, there are 
two phases of backoffs in a round, one for a node to 
transmit a control packet, and the other for a node 
to respond to the control packet by transmitting a 
data packet. We call the two phases of backoffs the 
initiation phase and the response phase, respectively. 
Figure [T] shows an example of a round. 

Intuitively, when a node n chooses a large value of 
t n and i n , it is likely that node n does not transmit, 
which increases its chance of being a free rider and 
reduces its transmission cost. However, a large value 
of t n and i n also means that node n may need to 
wait a long time before receiving a packet it needs, 
which results in large waiting cost. By taking waiting 
costs into account, our protocol encourages nodes 
to choose reasonable small value of t n and i n , and 
hence enables data exchange among nodes. We also 
note that this protocol is non-monetary and can be 
easily implemented for modern wireless networks 
without the need of additional infrastructure. 

Finally, we show that our protocol can be mod- 
ified to incorporate network coding. The modified 

Sxitocol is very similar to the one shown in Fig. 
The only difference is that, in the last part of a 
round, when node n sends a data packet, it sends 



a coded packet that contains one packet of each file 
that it has. For example, suppose that there are three 
files A, B, C in the system, and X n = A, then client 
n sends a data packet containing (one packet of 
B) + (one packet of C) after it receives a packet of A 
from m. Since we assume that every node possesses 
all but one files, the coded packet sent by n can be 
decoded by every client that needs either file B or 
file C. By using network coding, we guarantee that 
every client in the system obtains one packet in each 
round. Also, since clients that possess neither file B 
nor file C cannot decode the packet sent by n, this 
approach also prevents clients that need more than 
one files from joining the system. 

A. Performance Evaluation 

Under our protocol, each selfish node's strategy 
consists of two parts: choosing a distribution to gen- 
erate its backoff timer in the initiation phase, t n , and 
choosing a distribution to generate its backoff timer 
in the response phase, i n . We say that the strategies 
of all nodes in the system form a Nash Equilibrium 
if, for each node n, its strategy minimizes its average 
total cost, given the strategies of all other nodes. 

In the following sections, we analyze the per- 
formance of our protocol under Nash Equilibriums. 
We consider the performance from both nodes' per- 
spectives and the system's perspective. A node's 
performance is simply based on its average total 
cost. On the other hand, we consider the system's 
performance by its per-node average throughput, 
defined as the average download rate over all clients. 
We also define the price of anarchy on node cost and 
price of anarchy on system throughput as follows: 

Definition 1 : The price of anarchy on node cost of 
a node n under a Nash Equilibrium is the ratio of the 
average total cost of n under the Nash Equilibrium 
and the minimum possible average total cost of n 
when all nodes are cooperative. 

Definition 2: The price of anarchy on system 
throughput under a Nash Equilibrium is defined as 

(maximum possible per-node throughput) 
(per-node throughput under the Nash Equilibrium) ' 

III. Nash Equilibriums for Bilateral File 
Exchanges 

In this section, we analyze the performance of 
our protocol when there are only two files in the 
system. In such a system, we can divide nodes 
into two groups, nodes in one group, indexed by 
ai, 02, . . • , ai, have the file B and need the file A, 
that is, X a% = A, while nodes in the other group, 
indexed by b\,b2,...,bj have Xf, = B. In this 
setting, network coding is not employed. 

We will show that there is a series of Nash Equilib- 
riums where each node n chooses each of its backoff 
timers in the initiation phase and response phase 
as an exponential random variable. We focus on 



exponential random variables due to its memoryless 
property, which makes it easily implementable. 

Our analysis is based on the following theorem: 

Theorem 1: IfTOl Lemma 3.1] The strategies of 
nodes in a game form a Nash Equilibrium if, for 
each node n, given the strategies of other nodes, 
the expected cost of node n is the same regardless 
of its chosen values of backoff timers. 

We first consider the nodes' strategies on choosing 
the backoff timers in the response phase, i n . Without 
loss of generality, we assume that node b\ has sent 
the control packet in the initiation phase. Each 
node di secretly and randomly chooses a backoff 
timer t a .. Assume that the timer chosen by node 
di* is the smallest among all backoff timers, that is, 
t ai * = min{t ai , i a2 , . . . }. Node aj« will transmit after 
time i a . m , and the additional waiting time, which is 
the length of the response phase, for each node a, 
to download a packet is i ai , . In the expression of the 
additional waiting time, we exclude the time needed 
waiting for 61 to transmit its control packet and the 
time needed for transmitting data packets, as these 
times are not influenced by the values of i a . . In the 
following, we also exclude the waiting cost incurred 
by the time waiting for 61 to transmit its control 
packet and the time needed for transmitting data 
packets when discussing additional waiting cost and 
additional total cost. 

We show that there is a Nash Equilibrium where 
each node m chooses its backoff timer as an expo- 
nential random variable. In particular, we assume 
that i a% ~ EXP(\ a .), where the value of A aj will be 
determined in the sequel. 

We now apply Theorem [T] to determine the values 
of \ a% . Suppose that node chooses i ar _ = t. If t 
is smaller than all i ai , i ^ i*, that is, t < min^. i a ., 
node aj« needs to transmit a packet after waiting 
for t units of time, and thus the additional total cost 
for node a,» on this packet is g a . t + w a . t t. On the 
other hand, if t > min^* i a ., a node other than aj» 
transmits the packet, and node <2j* becomes the free 
rider. In this case, the additional total cost of node 
<2j* on this packet is w a ., (min^. {i a% }). Note that we 

have min^»{4j ~ EXP (J2i^ A <J- Let : = 
Yli^i* ■ The expected additional total cost of node 
i* can be written as: 



w a „s\-i*e x ~** s ds 



s=0 



/oo 
(g ai , +w ai ,t)\-i*e- x -*' s ds 

)• (1) 



Nash Equilibrium. This can be done by choosing 



w 



'a k 



w 



9a 



9a, 



(2) 



where I is the number of nodes in group 
{a±, <22, . . . }. We can also conclude that the expected 
value of mirij{i ai }, which is the expected duration of 
the response phase, given that a node in the group 
{61, 62, ... } transmits the control packet, is 



I -I 



(3) 



and that the expected additional total cost of node 

<H is g a% ■ 

Similarly, if a node a% transmits a control packet, 
each node bj selects 4 ~ EXP^^), where 



9b h 9b, 



The expected duration of the response phase is 



1 



J -I 



y.j 'V<, y.j 9b 



(4) 



(5) 



and the expected additional total cost of node bj is 

9b r 

Next, we consider the choice of backoff timer in 
the initiation phase, that is, the choice of t n for a 
node n. We will show that there is a Nash Equilib- 
rium where each node a, selects t a . ~ EXP(^ az ) 
and each node bj selects t\ )j ~ EXP^.). 

Assume that a node, say, node a^, selects t a ., = t. 
If t is the smallest timer among all timers, that is, t < 
mirij^j* t ai and t < mmjt^, node ai* transmits the 
control packet after time t. After which time, it needs 
to wait one of the nodes in {bi, 62, • • • } to respond 
with a data packet, and then a,i* needs to transmit 
a data packet. By the analysis above, we know that 
the expected time that waits for one of the nodes 
in {bi, 62, • • • } to respond is Tb- Thus, the expected 
total cost for node ctj» is g a .„ + w a ., (t + Tb) + 2iu a ., , 
where the last term, 2w ai , , accounts for the waiting 
cost caused by transmission delays, as it takes two 
units time to transmit two data packets. 

Next, consider the case that t > 
min{miiij^j. t a . , mirij tj, . }. We have that min^. t a . ~ 
EXPi^i^ll) and min,^. ~ EXP&j 76, )• By 
the memoryless property of exponential functions, 
we have that 

P A\{i'}<B 

■=Prob{mint a < mini&.|i > min{mini a .,minib }} 

i^i* 1 j 1 i^i* 1 j 3 



Thus, if g ai = for all i, the strategies form a 



That is, with probability Pa\{i*}<b> one °f the nodes 
in {ax, 0,2, • • • } other than transmits the control 
packet, and ,with probability 1 — Pa\{i*}<b> one 
of the nodes in {61,62,...} transmits the control 
packet. If it is the former case, node does not 
need to transmit any packets, and its expected 
cost is u) a .„ (min{min i ^j» t ai ,minj t b] } + T B + 2). 
If it is the later case, the expected cost is 
w a .„ (min{min^j. t a% , min, t bj } + 2) + g a% _ , since 
we have shown that the expected additional 
total cost of Oj« is g a ., . Hence, given that 
t > min{mirij^j» t ai , min.,- t b . }, the expected 
cost is u> a .» (min{minj^j. i a . , ruin, t b] } + 2) + 

PA\{i*}<B w a z *T B + (1 - PA\{i'}<B)9a i , ■ 

As we have minlmin^. t a . , mirij t b } ~ 
EXP&i^la* + Ei76,), by 1 letting 
1-v '■= E;^*7a, + Ej76 3 , the expected cost 
of cii* can be computed as: 



s=0 



[w ai , (s + 2) + P A \ {i » }<B w az *f B ]"/-i*e 7 - 4 * s ds 



+ I [(1 - P A \{i*}<B)9 ai *h-i-e 7 -*'' 

7s=0 



(<?a,. + (* + f B ) + 2u; ai ,)7_ 4 .e- 7 -*^s 



's=i 

<r„ . Ei^i* 7a, 



H — ' ?B + ^ " ffa,,. + 2w a ., 



+e 



7-i« 7-i* 
E j lb, 



7-i* 7-i* 7-i* 



(6) 

We wish to find {7^,7^, • • • , 7&i>76 2 , • • • } so that 
the expected cost of a,, is the same for all t. Hence, 
we require that 

(1 - PA\{i^}<B) w a i *T B + PA\{i*}<B9 ai , = — (7) 

7— t* 



i i^i* 
W a ,, _ Ei^j* 7ai 



W„ 



(8) 



(9) 



9a,. l-(Ei76 3 .)T B 

for all aj». Similarly, by studying the expected cost 
of a node we also require that 

"•'< • _ Sj¥i*7b, 



l-(Ei7a,)T A 



(10) 



for all 



Summing the ([9]) over all a,, and we have 



4^ 9a, ~ 1 



U -l)E,7a, 



(11) 



^(^Ta,)^ + (J>,)^ = 1. (12) 



Assume that (E i 7a,)r j4 = a and (E^)^ = 
1 - q, for some a G (0, 1). Using © and (flTT) . we 
obtain 



7a,* = 7a * ~ Y 7a * 
i i^j* 



Q' 



Ei^Wfi-a, 



W„ 



I -I 



9a z * 



Similarly, we also obtain 

E., 



7 6 = (1 - a)(- 



«>6 



J - 1 



96,- 



(13) 



(14) 



It is easy to check that, for every a e (0,1), setting 
j a . and j b . according to (TT3T) and ( fT4l) satisfies Q 
and (fTOl) for all nodes, and the expected cost of each 
node is the same regardless the actual backoff timer 
it chooses. Hence, (1131) and (fT4l) form a Nash Equi- 
librium. Further, as a can be any number in (0, 1), 
this game has infinitely many Nash Equilibriums. 

We summarize our results for systems with only 
two files in the following theorem. 

Theorem 2: For any a € (0, 1), if each node en* 
chooses t ai , ~ EXP(j at ,) and i a% , ~ EXP{\ ai ,), 
and each node bj* chooses t bj , ~ EXP(j3 b .„ ) and 
i bj , ~ EXP(X b ,l), where 7„., , \ a .„ , -y bj , , and A 6 .» 
are chosen by ([13]), ©, Q~|]), and ©, then these 
strategies form a Nash Equilibrium. 

IV. Price of Anarchy for Bilateral File 
Exchanges 

We have found Nash Equilibriums for our protocol 
when there are only two files in the system. We now 
discuss the performances of these Nash Equilibriums. 

Suppose all nodes choose their backoff 
timers according to Theorem [H for some 
a G (0, 1). The duration of the initiation phase is 
min{t ai ,t a2 , . . . ,t bl ,t b2 , . . .}, which is an exponential 
random variable with mean 



Ta,b ■'- 



1 



Ei 7a, + E j 76j 



(15) 



Also, the probability that one of the nodes in group 
{ax, a,2, • • • } transmits the control packet is 



E'7a- 

Pro6{mint ai < mint;, } = ^ —. ^ . 

< 3 Ei 7a, + Ej lb, 



(16) 



Using © and ©, we can express the expected 
amount of time for the two groups of nodes to 
exchange two packets as 



T A , B + 



Ei7a, 



Ei 7a, + Ej 

f A + f B + 2, 



T B + 



E,-7 0j 



E t 7a, + Ej 76, 



T A + 2 



(17) 



where the last term in the equation accounts for 
the time needed for transmitting two packets. Since 
every node downloads a packet in each round, the 



per-node average throughput of our protocol is then 
1/{Ta + Tb + 2) packets per unit time under the 
described Nash Equilibriums. On the other hand, 
the download rate of a client is at most 0.5 packet 
per unit time, and the maximum possible per-node 
throughput is 0.5. Therefore, the price of anarchy 
on system throughput is (T A + Tb + 2)/2. To better 
understand the price of anarchy on system through- 
put, we consider the special case where all nodes 
have the same parameters for waiting cost and for 
transmission cost, that is, w n = w and g n = g for 
all n. In this case, we have f A = and 
rp B _ (J-i)jg/w) ^ anc [ t ] ie p r i ce f anarchy on system 
throughput is (ItDjaM + V-VjaM + 2 )/2 < 1 + £ . 

We now compute the average total costs of nodes. 
The probability that a node transmits a packet in 
a round can be expressed as 

Prob{mmt a . < min£&. }Prob{t ai , < t a ,\/i ^ i*} 

i ' j 1 

+ Prob{mmt a . > mint^.}Pro6{t a . t < t a .,\/i ^ i*} 



" I -I 



The average total cost of node is then 



w at , (T A + f B + 2) 
i yai* \ 



J - 1 

(J - iK, 



_ W a% , y /52i w ai/9ai 
9a v * I ~ 1 

+ 2w ai , . 



(18) 



On the other hand, under our protocol, the down- 
load rate of node ctj» is at most one packet per 2 unit 
times. Hence, the average total cost of node is at 
least 2w ai , . We then have that the price of anarchy 

on node cost of a** is at most + 1-2) /2. 

In the special case where w n = w and g n = g, for all 
nodes,, the price of anarchy on node cost of cij* is at 
most (A + gy + 2 )/2<l + A 

We note that both the price of anarchy on system 
throughput and the price of anarchy on node cost 
increase with Intuitively, when g is small com- 
pared to w, nodes focus more on improving their 
download rate than on reducing transmission cost. 
Hence, nodes tend to choose small backoff timers 
and result in small price of anarchy. On the other 
hand, when g is much larger than w, transmission 
costs become an important factor of nodes' costs. 
Hence, each node tends to choose large backoff 
timers to increase its chance of becoming a free rider, 
which results in large price of anarchy. 

As a final remark, we note that both the sys- 
tem throughput and total average costs of nodes 
remain the same for all a £ (0,1). Therefore, the 
performance of the system is the same for all Nash 
Equilibriums described by Theorem [2j 



V. Implementation Issues and Convergence 

Section [III] has described a Nash Equilibrium for 
a system with two files. However, for a node n to 
derive its strategies, that is, to compute the values 
of 7„ and A n , node n needs to know information of 
the whole network, including the private values of 
w m and g m for all other nodes m. In this section, 
we propose a distributed mechanism for each node 
to update its values of 7„ and A n only based on its 
values of w n and g n and the history of the system. We 
show that this mechanism is compatible to the node's 
incentive, in the sense that the updated 7 n and 
A ra achieve smaller average total cost for the node. 
Moreover, we also show that the system converges 
to a Nash Equilibrium when all nodes apply this 
mechanism. 

We order the two flies by lexicographical order. If 
file A has higher order than B, we impose that j az = 
0, for all i, and Aj,. = 0, for all j. This corresponds 
to the case where a = in Section [TTTJ Therefore, 
in every round, a node in {61,62,...} transmits a 
control packet in the initiation phase and a node in 
{ai, G&2, . . . } transmits a data packet in the response 
phase. On the other hand, if file B has higher order 
than A, we impose that A a . = 0, for all i, and 7^ = 0, 
for all j, which corresponds to the case where a = 1. 
Without loss of generality, we assume that file A has 
higher order than B. 

Using ©, ©, ([T4j , we have that f A - 1 - 
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, and 7k, 



J 3 — at the Nash Equilibrium, where I and J 

T B 9b r 

are the number of nodes in groups {01,02, . . . } and 
{61, 62, . . . }, respectively. As we set a = 0, T A is the 
average backoff time in the response phase, and Tb 
is that in the initiation phase. 

We now introduce our mechanism for a node a,. . 
Node aj. first guesses that the average amount of 
backoff time in the response phase is T a ., t o, and 
sets A,, .,, = ^ £1 . Node a«. then observes the 

system behaviors and updates its value of A ai , every 
M rounds. When updates \ a ., the k th time, it 
computes the average backoff time in the response 
phase since it last updates its values, denoted by 
T A k-i- Node ai* then sets T a „ & so that 
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'-(H*,k-1 



4(- 
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<-A±-l 



and A a „ = ^ tt^j where the values of 5k are 

chosen so that YlT=i ^ k = 00 an< ^ Efcli $1 < °°- 
For example, one can choose 6k = |, where e is a 
small constant. The mechanism for a node bj* can 
be derived similarly. The only difference is that node 
bj* updates its value of 7^, based on the average 
backoff time in the initiation phase. 



As described in the following theorems, this mech- 
anism has two important features. First, this mecha- 
nism is compatible to the node's incentive. Second, 
this mechanism converges to a Nash Equilibrium. 

Theorem 3: Fix the values of j n and A n , for all 
n / aj«, such that j a . = 0, for all i, and Xf, j = 0, for 
all j. The average total cost of node a,, is smaller 
when it sets A fli » = -~r-^ -r^ 1 , than when it sets 



— , for all k. 



Proof: Since we impose 7 a4 , = 0, node 
has no control on the amount of backoff time in 
the initiation phase. Hence, it suffices to show that 



setting X ai , 



achieves smaller average 



T ai , ,fc+l 9a 

additional total cost in the response phase than 
setting \ a% . 



9a it 



We have T A , 
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XW A. 4 .+(, 
... A„ . . 
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and fir a ., 



a,i», t. Moreover, if g ai , 



rM- By ©, if 
> 0, the expected additional cost 

of Oj» strictly decreases with the backoff timer of 

Wai * x — > 0, we have 

f ai ,,k > T A,k and T a% , )k < f a ^, tk+1 . For every 
positive constant C, Prob(t < C) is smaller when 
V. = 7?r^ than when A a ., = 

T a .,, k+1 9a,, a -* T a .,, k 

Therefore, the average additional total cost 

- than when 



is smaller when A aj 



i 



9a , 



On the other hand, if g a . 



< 0, the 



expected additional cost of aj» strictly increases 
with the backoff timer of di», t. Moreover, if g a ., — 

Y2™.,'\a. t > °> we have Ta,*,k < f a .^ k+ i, and, for ev- 
ery positive constant C, Prob(t < C) is larger when 



than when \ a .» 



T a% ,, k+1 9a,, ^" T^.,* 9a,, 

Therefore, the average additional total cost is also 
smaller when A a .» = ■^-^ — — than when 



Aa " ~ j— -ia~:- ■ 

Theorem 4: Fix the values of 7„ and A„, for all 
n / such that 7 . = 0, for all z, and Xf,. = 0, 
for all j. The average total cost of node bj* becomes 
smaller when it updates its 75.. . 

Proof: The proof is very similar to that of The- 
orem [3l and is hence omitted. ■ 

Next, we show that our mechanism converges to 
the Nash Equilibrium. 

Theorem 5: If all nodes apply the proposed mech- 
anism, then the value of T4 k and Tg k after each 



node updates k times converge to Ta 
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and 
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X, 



■, respectively, as k — >■ 00. 



Proof: We only prove that converges to T^. 
Under our mechanism, the value of \ a% after node 
<ij updates k times is 
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VI. Nash Equilibriums for Multiple File 
Exchanges with Network Coding 

In this section, we derive the Nash Equilibriums 
for systems that incorporate network coding and 
have more than two files. We use Ia,Ib,Ic, ■ ■ ■ to 
denote the number of nodes that need A,B,C, . . . , 
respectively. We will show that there exist a Nash 
Equilibrium where each node n chooses each of 
its backoff timers in both phases as an exponential 
random variable. A lot of derivations in this section 
are similar to those in Section [TTTJ Hence, we omit 
some details and only report the major results in this 
section due to space limit. 

We first consider the nodes' strategies in the re- 
sponse phase. Assume that a node n with X n = X 
has sent the control packet in the initiation phase. 
Each node m with X m ^ X secretly and randomly 
chooses a backoff timer t m ~ EXP(\ m \ x )- Note that 
the value of \ m \x may depend on X. Let uiq be the 
node that chooses the smallest value of t^. Then, 
mo will transmit a data packet of X after i mo time. 
After mo transmits the data packet, n will transmit a 
coded packet that contains one packet from each of 
the files except X, and every node m with X m ^ X 
can decode one packet from the transmission from 
n. Similar to the derivations of ©, we can show 
that, at a Nash Equilibrium, we have 
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(19) 



for all m such that X m ^ X. 

Next, we consider the nodes' strategies in the 
initiation phase. Assume that each node n chooses 
a backoff timer t n ~ EXP(j n ). Let Tx ■= 
E n -x =x7n- Similar to the derivations of ([9]), (HOP , 
and (Il2l) . we have that, at a Nash Equilibrium: 
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for all X. (I21D represents a series of linear equations 
where both the number of unknowns, {Tx}, and 
the number of equations equal to the number of 
files. We can use standard techniques for solving 
linear equations to obtain a solution of {Tx} to 
(l2TT) . We can then use {Tx} to obtain the values of 
{j n } through (|20l) . The derived {^ n } forms a Nash 
Equilibrium. 

VII. Numerical Results 

We now present our simulation results. We first 
consider a system with two files, A and B, and 
20 nodes. We assume that there are 10 nodes that 
possess file A and need file B, and the other 10 
nodes possess B and need A. We set g n = 1 for 
all n, and w n is uniformly distributed in [1, 2]. Each 
node applies the distributed mechanism introduced 
in Section LY| to update its strategy. Each node n sets 
f nfi = l00 ^l/ 9n ) > tnat is > il: guesses that there are 
100 nodes in its group, an overestimate by a factor of 
10, and all nodes in its group have the same values of 
w n and g n as itself. Also, each node sets 8k = 0.1/ k. 

Fig. [2] shows the resulting per-node throughput 
after each update. It can be shown that, even though 
the initial strategies of nodes are far from the 
Nash Equilibrium, the per-node throughput under 
our mechanism converges to the Nash Equilibrium 
very quickly. With just three updates, the per-node 
throughput is about 85% of that under the Nash 
Equilibrium. 

Next, we consider systems that have more than 
two files and employ network coding. We assume 
that, for each file, there are ten nodes that need 
it. Therefore, there are a total number of 10 x 
{number of files} nodes in the system. We also set 
g n = 1 for all nodes, and w n uniformly distributed in 
[1,2]. We use the procedure described in Section fVIl 
to derive the values of j n and A n for all n, based on 
which we calculate the per-node throughput under 
a Nash Equilibrium. 

Fig. [3] shows the per-node throughput under vari- 
ous numbers of files in the system. We compare the 
performance of our protocol against the maximum 
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Fig. 2: Per-node throughput with two files. 
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Fig. 3: Per-node throughput with multiple files and 
network coding. 



possible per-node throughput when network coding 
is not employed and when all nodes are cooperative. 
Without network coding, each transmission only 
contains one packet of one file. Hence, the maximum 
possible per-node throughput is l/{number of files}. 
Fig. [3] shows that, when there are only three files in 
the system, our protocol has slightly worse per-node 
throughput than the case when network coding is 
not employed and all nodes are cooperative. This is 
because our protocol considers the selfish behaviors 
of nodes and the times spent on the two phases 
of backoffs, while the compared scenario assumes 
all nodes are cooperative and hence there is no 
time spent on backoffs. However, as the number 
of files increases, the benefits of network coding 
outweigh the prices of anarchy. As a result, our 
protocol achieves better per-node throughput than 
the scenario where network coding is not employed 
and all nodes are cooperative. 

VIII. Conclusions 

We have discussed the problem of content distri- 
bution over wireless P2P networks. We have pro- 
posed a model that considers both the broadcast 



nature of wireless transmissions and the incentives 
of nodes. Based on the mode, a protocol for con- 
tent distribution has been designed. The protocol 
encourages nodes to contribute to the network and is 
non-monetary. We have studied the performance of 
our protocol when all nodes are selfish. For systems 
with only two files, we have demonstrated that there 
are closed-form expressions for Nash Equilibriums 
and prices of anarchy. We have also proposed a 
distributed mechanism where all nodes update their 
strategies only based on their respective private 
information and the history of the system. We have 
provided numerical results that show that this mech- 
anism converges to Nash Equilibriums very quickly. 
For systems with more than two files, we propose 
a simple extension of our protocol to incorporate 
network coding. We then describe a procedure to 
compute each node's strategy under a Nash Equilib- 
rium. Numerical results show that our protocol may 
achieve better performance than scenarios where 
nodes are cooperative but do not employ network 
coding. 
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